Model Optimization, Inference Engines, LLM Quantization, Privacy-focused Deployments

Feeds to Scour
SubscribedAll
Scoured 7666 posts in 166.2 ms
Decoupling the AI Stack: How to Architect a Production-Grade Local LLM System
dev.to·4h·
Discuss: DEV
🏗️AI Infrastructure
Preview
Report Post
Search over Self-Edit Strategies for LLM Adaptation
arxiv.org·3h
Incremental Computation
Preview
Report Post
MLSN #18: Adversarial Diffusion, Activation Oracles, Weird Generalization
lesswrong.com·1d
💻Local LLMs
Preview
Report Post
What AI Accountability Looks Like (I Built It)
forgeforward.substack.com·12h·
Discuss: Substack
🤖AI Coding Tools
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·14h
Hardware Acceleration
Preview
Report Post
Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)
arxiv.org·3h
💻Local LLMs
Preview
Report Post
AI Systems Performance Engineering
github.com·7h·
Discuss: Hacker News
🏗️AI Infrastructure
Preview
Report Post
Privacy-Preserving Active Learning for heritage language revitalization programs with zero-trust governance guarantees
dev.to·22h·
Discuss: DEV
💻Local LLMs
Preview
Report Post
I replaced my ChatGPT subscription with a 12GB GPU and never looked back
xda-developers.com·11h
Hardware Acceleration
Preview
Report Post
MIT’s new ‘recursive’ framework lets LLMs process 10 million tokens without context rot
venturebeat.com·1d·
🏗️AI Infrastructure
Preview
Report Post
The Democratization of AI
build.ms·1d
🤖Anthropic Claude
Preview
Report Post
Edge AI: The future of AI inference is smarter local compute
infoworld.com·2d
📱Edge AI
Preview
Report Post
Privacy-first AI art, zero data stored
redhorseoracle.com·14h·
Discuss: Hacker News
🔐Decentralized Identity
Preview
Report Post
Using Local LLMs to Discover High-Performance Algorithms
towardsdatascience.com·2d
⚙️LLVM
Preview
Report Post
Learning from Models
rodney.bearblog.dev·1d
🤖Reinforcement Learning
Preview
Report Post
Navigating AI Entrepreneurship: Insights From The Application Layer
kdnuggets.com·17h
🤖AI Coding Tools
Preview
Report Post
Show HN: Autonoma – Air-Gapped AI Code Engineer (L5 Autonomy)
vihaaninnovations.github.io·1d·
Discuss: Hacker News
🤖AI Coding Tools
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
📱Edge AI
Preview
Report Post
Elysia.js: Type-Safe APIs Without the Mess
spin.atomicobject.com·1d
📘TypeScript
Preview
Report Post
Why AI Needs GPUs and TPUs: The Hardware Behind LLMs
blog.bytebytego.com·2d
Hardware Acceleration
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help